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Executive  Summary 


The  focus  of  this  report  is  on  major  improvement  and  system  test.  Receiver  sensitivity  has  been  significantly  im¬ 
proved  and  about  3  dB  time  reversal  precoding  gain  has  been  observed  based  on  system  trial  in  indoor  non-line-of- 
sight  scenario.  Theoretical  study  shows  that  additional  gain  can  be  achieved  by  using  optimum  waveform  design. 
Listed  below  are  major  achievements  during  the  quarter: 


•  multiple  integration  window  sizes; 


•  comprehensive  system  test; 

•  improved  receiver  sensitivity  to  -81  dBm; 

•  3  to  4  dB  time  reversal  precoding  gain  in  indoor  non-line-of-sight  scenario. 

We  will  continue  to  improve  the  system  and  conduct  more  measurements,  while  seeking  some  technology  transfer 
opportunities. 
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CHAPTER 1.  MAJOR  IMPROVEMENTS 


Our  major  development  work  in  the  last  quarter  is  to  test  and  modify  the  system.  The  most  significant  part  is  the 
synchronization  at  the  receiver  side. 


1.1  Synchronization  Improvements 

For  a  burst  mode  radio  system,  the  synchronization  has  always  been  a  challenge.  In  order  to  make  the  synchro¬ 
nization  more  robust  for  the  test-bed,  we  have  made  some  improvements,  which  include  modification  of  the  frame 
structure,  average  samples  for  start  pattern  searching  and  various  thresholds  adjustment. 


1.1.1  Synchronization  Review 

Before  talking  about  these  synchronization  improvements,  let’s  first  review  the  receiver’s  implementation  structure 
and  data  flow.  The  functional  diagram  is  shown  as  Fig.  1.1.  The  main  modules  include  data  interface  between  FPGA 
and  ADC,  integration,  synchronization,  decision,  a  finite  state  machine  and  thresholds  control.  Among  them,  the 
challenges  at  the  receiver  are  mainly  on  the  high  speed  interface,  fast  integration  and  the  robust  synchronization.  We 
are  currently  using  a  3  stage  synchronization  technique  which  is  shown  in  Fig.  1.3. 


Figure  1.1:  Receiver  functional  diagram 

Fig.  1.2  is  the  data  flow  for  the  receiver.  First  the  interface  module  converts  the  high  speed  ADC  output  data  to 
relatively  low  speed  data  streams  through  several  steps.  Then  the  data  streams  go  to  start  pattern  searching  module 
to  find  the  time  of  arrival  and  go  to  integration  module  for  chip  level  energy  integration.  There  are  32  fast  integrators 
for  integration,  which  are  implemented  by  DSP  cores  embedded  in  Virtex-5  FPGA.  The  synchronization  module 
combines  the  time  of  arrival  signal  and  the  integration  results  by  finding  the  integrator  with  maximum  energy,  then 


LI.  SYNCHRONIZATION  IMPROVEMENTS 


5 


it  gets  the  chip  threshold.  After  that  the  receiver  applies  frame  level  synchronization  to  find  the  fine  timing  and  make 
decisions.  The  finite  state  machine  and  variable  thresholds  control  module  coordinate  all  the  processes. 
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Figure  1.2:  Receiver  data  flow 


1.1.2  Frame  Structure 

The  system  is  supposed  to  find  the  window  with  maximum  energy  in  every  40  ns,  which  is  the  chip  period.  Fig. 
1.3  shows  the  new  frame  structure  of  the  system.  The  first  stage  is  to  search  the  time  of  arrival;  the  second  stage 
is  to  average  128  chips  energy  to  find  the  chip  threshold;  the  third  stage  is  to  synchronize  the  whole  frame.  Data 
demodulation  is  performed  after  the  three-stage  synchronization.  Fig.  1.3  also  shows  the  sliding  window  integration 
diagram  and  the  state  transition  diagram. 

The  modification  in  the  synchronization  head  does  not  change  the  overall  frame  structure.  The  following  parameters 
are  kept:  system’s  frame  length:  4096  chips  or  163.84  /is;  payload  efficiency:  94.49%;  chip  sync  accuracy:  one 
sample  (1.25  ns);  and  frame  sync  accuracy:  one  chip  (40  ns).  If  the  overall  clock  accuracy  is  20  PPM,  then  the 
maximum  time  drift  is  163.84  x  10~6  =  3.276  /is. 


1.1.3  Dense  Pulses 

However,  even  though  we  averaged  the  start  pattern  searching,  wc  observed  that  the  synchronization  error  still 
mainly  comes  from  the  first  stage.  In  this  stage,  the  threshold  is  very  sensitive  to  noise  level  and  DC  offset  of  ADC 
sampling,  especially  for  longer  distance  transmission.  So  we  try  to  send  5  chips  long  dense  pulses  at  the  beginning 
of  each  frame  to  make  sure  the  time  of  arrival  is  found  each  frame  since  the  average  value  of  these  chips  should  be 
much  higher  than  ordinary  ones.  Fig.  1 .4  shows  the  real  picture  of  dense  pulses  at  the  beginning  of  each  frame. 
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Find  Che  window  with  maximum  energy  in  40  ns  (Chip  level) 


Figure  1.3:  The  new  frame  structure  design 


Figure  1 .4:  The  picture  of  dense  pulse  at  the  beginning  of  each  frame 
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1.1.4  Start  Pattern  Averaging 

For  the  first  synchronization  stage,  we  accumulate  the  values  of  32  consecutive  samples  and  compare  it  with  a 
threshold  to  claim  whether  it  is  one  frame’s  time  of  arrival.  However,  due  to  the  interference  and  environment’s 
uncertainty,  the  first  stage  result  should  not  be  so  reliable  when  it  just  counts  on  the  value  only  one  chip  time, 
because  the  time  frame  is  too  short.  So  we  try  to  average  two  chip  time  samples  result  to  get  the  averaged  value,  and 
compare  it  with  a  threshold,  Fig.  1 .5  shows  the  diagram  of  this  improvement,  where  we  set  a  more  strict  condition  to 
claim  the  time  of  arrival  result  by  comparing  5  times(or  other  number)  instead  of  only  one,  and  is  in  5  chips  time(or 
other  number),  3  high  results  are  needed  to  claim  the  first  stage  synchronization  result.  After  that,  because  we  send 
dense  pulses  at  the  beginning  of  each  frame  and  there  is  a  interval  between  the  pulses  and  the  real  chips,  we  delay 
two  chips  time  before  the  beginning  of  second  stage. 


1.1.5  Other  modifications 

For  the  third  stage, we  also  utilized  60  bits  M  sequence  instead  of  60  bits  OOC  code  for  frame  synchronization.  This 
is  mainly  because  the  randomness  of  M  sequence  is  much  better  than  OOC  code. 

For  time  reversal  system  and  non  time  reversal  system,  the  received  pulses  are  very  different  at  the  width,  typically 
6  ns  for  time  reversal  and  15  ns  for  non  time  reversal.  So  in  the  energy  integration  module,  the  window  size  for 
integration  should  be  different  for  different  signal  scenarios.  This  parameter  is  to  be  adjusted  in  integration  module, 
as  described  in  the  last  quarter  report. 


1.2  Threshold  Adjustment 


For  energy  detection  based  receiver,  it  is  well  known  that  the  performance  depends  significantly  on  the  choice 
of  threshold  level.  A  good  threshold  can  be  determined  by  using  some  channel  quality  indicator  and  feedback 
information  provided  by  the  digital  processor  (back-end)  at  the  receiver.  There  is  an  optimum  threshold  for  lowest 
error  rate.  To  deal  with  ISI  situation,  it  is  proposed  to  set  a  threshold  based  on  two  worst  signal  cases. 

One  phenomenon  discovered  is  that  a  mono-bit  ADC  along  with  proper  thresholding  can  be  applied  for  quantizing 
the  pre-filters  coefficients,  and  the  resulting  signal  still  gets  focused.  It  enables  the  development  of  a  unique  time 
reversal  system  with  much  lower  complexity.  Major  points  include:  (1)  optimization  of  the  threshold  for  quantiza¬ 
tion;  (2)  development  of  an  adaptive  threshold  method;  (3)  impact  of  tap  spacing  and  length  of  the  pre-filter.  In  our 
case,  there  are  4  variable  thresholds  at  the  receiver  side  with  each  corresponds  to  different  stages  of  synchronization 
and  detection.  They  are  the  threshold  for  start  pattern  searching,  the  threshold  for  ADC  bias  shift,  the  code  distance 
for  frame  level  synchronization  and  the  threshold  of  chip  threshold  determine  at  the  second  synchronization  stage. 
All  these  values  are  adjusted  by  the  pushbuttons  on  the  receiver  Virtex-5  board  and  the  led  lights  can  show  any 
threshold’s  adjustment  level. 

DC  offset  causes  performance  degradation  in  signal  processing  systems  especially  for  high-speed  applications.  For 
our  ADC  board  in  the  receiver  side,  the  DC  shift  makes  the  true  quantization  value  hard  to  determine,  thus  makes 
the  threshold  vulnerable,  especially  for  the  energy  detection  based  receiver  with  OOK  modulation.  DC  offset  can  be 
reduced  in  real-time  by  subtracting  the  mean  amplitude  from  each  sample,  or  by  calibrating  the  ADC  board. 
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The  new  start  pattern  searching 


Figure  1 .5:  Start  pattern  searching  improvement 


Chapter  2 


FPGA  Implementation 
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CHAPTER  2.  FPGA  IMPLEMENTATION 


2.1  FPGA  Timing  Closure  Issues 


For  FPGA  implementation,  meeting  timing  requirements  in  speed-critical  designs  has  been  always  a  challenge,  and 
this  is  often  an  iterative  process  ensuring  that  each  and  every  path  in  the  design  meets  the  required  timing.  Meeting 
timing  closure  is  easy  and  automatic  for  relatively  slow  or  small  designs.  However,  most  designs  do  not  fall  into 
that  happy  category  and  as  each  critical  path  is  adjusted  in  order  to  meet  timing,  new  ones  are  uncovered  or  created 
anew.  In  the  case  of  FPGAs  implemented  at  the  65nm  technology  node,  for  example,  wire  delays  can  account  for 
80-to-90%  of  each  path’s  delay. 

An  important  technique  for  achieving  successful  timing  closure  on  an  aggressive  design  is  to  carefully  review  the 
most  critical  timing  constraints.  In  our  case,  there  are  two  critical  paths  at  the  transmitter:  one  is  from  the  data 
loading  module  to  the  waveform  generator  module  where  huge  parallel  data  steams  exist,  another  critical  path  is 
in  the  parallel  to  serial  conversion  module  requiring  high  clock  rate.  At  the  receiver  side,  the  most  critical  path 
is  from  the  integration  module  to  the  synchronization  module  which  performs  fast  timing  acquisition  in  parallel 
manner.  Achieving  clock-to-clock  (global)  timing  for  all  internal  signals  in  a  synchronous  design  may  be  easy, 
but  this  simple  approach  will  usually  overconstrain  the  design  and  eventually  lead  to  a  failure.  Therefore,  we  have 
applied  different  constraints  to  different  paths  depending  on  their  timing  requirements. 

The  impact  of  timing  constraints  can  be  seen  in  Fig.  2.1,  where  the  left  picture  shows  the  output  waveform  from 
the  DAC  when  appropriate  timing  constraints  are  applied,  while  the  right  one  shows  the  waveform  with  unwanted 
spikes  when  timing  constraints  are  not  properly  added. 


Figure  2. 1 :  Output  waveforms  comparison  with  and  without  applying  proper  timing  constraints 
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2.2  Major  FPGA  Implementation  Reports 

Table  2.1  shows  the  resource  usage  for  the  test-bed  back-ends  implemented  on  Xilinx  Virtex-5  LXT  FPGAs.  Note 
that  the  FPGA  chips  used  at  the  transmitter  and  receiver  are  of  the  same  model,  but  the  transmitter  FPGA  chip  has 
1 136  I/O  ports  while  the  receiver  FPGA  chip  has  only  665  I/O  ports.  As  we  can  see,  at  both  the  transmitter  and 
receiver  there  are  a  lot  of  FPGA  resources  available.  The  routing  result  is  shown  in  Fig,  2.2,  and  the  resource  usages 
are  visually  displayed. 


Figure  2.2:  FPGA  routing  results  for  both  transmitter  and  receiver 


12 


CHAPTER  2 .  FPGA  IMPLEMENTATION 


Table  2.1: 


FPGA  Implementation  statistics  at  transmitter  and  receiver 


Resources 

Transmitter 
Amount  used 

Transmitter 

Percent  used 

Receiver 
Amount  used 

Receiver 
Percent  used 

Number  of  Slice  Registers 

5314 

18% 

4184 

14% 

Number  of  Slice  LUTs 

5019 

13% 

3088 

10% 

Number  of  occupied  Slices 

1719 

24% 

1713 

23% 

Number  of  bonded  IOBs 

40 

8% 

90 

25% 

Number  of  BlockRAM/FIFO 

2 

3% 

1 

1% 

Number  of  BUFG/BUFGCTRLs 

7 

21% 

10 

31% 

Number  of  DSP48Es 

NA 

NA 

32 

66% 

Total  equivalent  gate  count 

275,214 

NA 

195438 

NA 

Chapter  3 


System  Test 

3.1  DAC  Interpolating  Filter  Measurement 


The  equivalent  baseband  filter  in  the  transmitter-receiver  chain  has  to  be  taken  into  account  in  order  to  have  optimum 
waveform  at  the  receiver  detector.  This  filter  is  generally  not  flat  over  the  frequency  band  of  interest,  especially  for 
wide  bandwidth  case.  Estimating  and  controlling  the  equivalent  baseband  filter  is  not  easy.  As  shown  in  Fig.  3.1,  the 
overall  filter  effect  is  contributed  from  three  sources:  the  DAC’s  interpolating  filter  p(t)  and  RF  front-ends  including 
antennas  at  both  sides.  By  omitting  some  scalar  factor,  the  equivalent  filter  in  baseband  format  is 

F{t)  =  F0(t)  ®  Fi(t)  ®  F2{t),  (3.1) 

and  Fo(t)  can  be  expressed  using  the  interpolating  filter  p(t) 

Fo(t)  =p(t)  +j  -p{t).  (3.2) 

According  to  our  measurement,  the  shape  of  the  equivalent  filter  is  dominated  by  the  interpolating  filter  p(t).  Based 
on  the  ADC  used  in  the  testbed,  p(t)  has  been  accurately  measured  and  Fo(t)  =  p(t)  4-  j  •  p(t)  has  been  used  as  an 
estimate  of  F(t). 

To  measure  the  filter,  a  full  scale  (14-bit)  1-ns  wide  digital  pulse  directly  generated  in  the  FPGA  chip  is  used  as  a 
trigger  source,  resulting  in  a  analog  pulse  at  the  DAC’s  output.  This  pulse,  as  shown  in  Fig.  3.2,  is  actually  the 
interpolating  pulse  shape  (filter)  corresponding  to  sampling  rate  of  1  Gs/s.  The  pulse  width  is  over  1  ns  and  there  is 
a  tail  behind  the  pulse,  implying  imperfect  circuit  coupling.  The  spectrum  of  the  interpolating  filter  is  depicted  in 
Fig.  3.3. 
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CHAPTER  3.  SYSTEM  TEST 


Transmitter 


i 


Receiver 

Figure  3.1:  Equivalent  filter  in  transmitter-receiver  chain  (not  including  the  propagation  channel),  contributed  from  three 
sources. 


Figure  3.2:  Measured  interpolating  pulse  shape. 


3. 1 .  DAC  INTERPOLATING  FILTER  MEASUREMENT 
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Figure  3.3:  The  spectrum  of  the  interpolating  filter. 
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Transmitter 


TX 


Confined  Metal  Box 


RX 


Receiver 


Furnitures,  equipments,  ... 


Office 


Figure  3.4:  Layout  for  measurement  in  metal  box. 


3.2  Performance  Measurement 


The  system  was  tested  in  a  typical  officc/lab,  non-line-of-sight  (NLOS)  environment.  The  layout  for  office/lab 
environment  is  illustrated  in  Fig.  3.4.  The  transmitter  and  the  receiver  are  set  in  one  room,  blocked  by  a  confined 
metallic  box.  The  height  of  both  antennas  is  about  1.5  m.  Separation  distance  between  the  two  antennas  is  not  a 
good  metric  since  the  path  loss  and  the  multipath  profile  do  not  solely  depend  upon  the  distance.  The  path  loss 
in  this  scenario  is  over  70  dB.  Two  30  dB  low-noise-amplifiers  (LNA)  are  used  to  compensate  the  path  loss  at  the 
transmitter  and  receiver,  respectively. 

We  first  use  vector  network  analyzer  (VNA)  to  measure  the  channel  and  get  the  Time  Reversal  I/Q  waveforms  via 
the  method  mentioned  in  the  January  report.  Fig.  3.5  shows  channel  transfer  function  (CTF)  and  Fig.  3.6  shows 
the  I/Q  modulation  waveforms.  Before  loading  the  trimmed  Time  Reversal  waveforms  into  FPGA,  we  load  the 
full-length  waveforms  to  the  DAC’s  waveform  memory  to  verify  the  validity  of  Time  Reversal.  If  a  focused  signal 
can  be  observed  at  the  receiver,  then  we  say  the  waveforms  are  correct.  It  can  be  easily  observed  that  only  the  Time 
Reversal  one  is  focused  in  time. 

Trimmed  waveforms  are  loaded  in  FPGA  to  conduct  the  system  trial.  Receiver  sensitivity  is  measured  in  this  trial. 
We  reduce  the  transmit  power  until  the  receiver  can  not  detect  any  signal.  Based  on  this  criterion,  it  is  found  that 
with  insignificant  ISI,  the  receiver  sensitivity  (measured  in  average  receive  power)  for  the  time  reversal  is  around  -81 
dBm.  The  transmit  spectra  of  Time  Reversal  system  and  impulse  system  are  depicted  in  Fig.  3.7  and  their  powers  are 
6.1  dBm  and  8.9  dBm  respectively.  The  receive  spectra  of  Time  Reversal  system  and  impulse  system  are  depicted  in 
Fig.  3.8  and  both  of  the  powers  are  -81  dBm.  As  a  result,  Time  Reversal  system  has  a  2.8  dB  gain  from  the  channel 
over  impulse  system. 


3.2.  PERFORMANCE  MEASUREMENT 
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Figure  3.5:  Channel  transfer  function,  measured  data  from  VNA 


Time  (ns) 


Figure  3.6:  Quantized  I/Q  modulation  waveforms  for  DAC.  Top  is  the  I  channel  waveform  and  bottom  is  the  Q 
channel  waveform 
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Figure  3.7:  Transmit  spectra  for  Time  Reversal  system  and  impulse  radio  system. 
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Frequency  (MHz) 


Figure  3.8:  Reeeive  speetra  for  Time  Reversal  system  and  impulse  radio  system. 


Chapter  4 


Current  and  Near-Future  Work 


Although  a  gain  contributed  by  time  reversal  precoding  has  been  achieved,  we  believe  there  are  still  some  room  to 
improve  the  hardware  implementation  and  the  performance.  Moreover,  we  have  started  to  design  a  new  test-bed 
with  compressive  sensing  capability. 


4.1  Synchronization  Optimization 

The  proposed  3-stage  synchronization  scheme  follows  proper  design  philosophy  for  burst  mode  communication.  We 
have  found  that  the  synchronization  performance  varies  depending  on  the  system  configuration.  There  are  a  number 
of  parameters  that  may  need  to  be  optimized.  However,  globally  optimizing  the  overall  synchronization  performance 
is  too  challenging.  A  practical  optimization  strategy  is  to  combine  pure  analysis,  measurement  and  simulation.  A 
detailed  methodology  needs  to  be  investigated  before  starting  the  optimization  process. 


4.2  Newer  DAC  and  ADC 


The  DAC  (MAX  108)  used  in  the  receiver  is  not  ideal  in  terms  of  sampling  rate  and  connection.  We  are  planning  to 
replace  it  with  some  newer  product  to  reach  a  sampling  rate  of  2  Gs/s  or  above.  Current  the  sampling  rate  is  only 
800  Ms/s,  which  might  cause  performance  degradation  since  the  time  reversal  pulse  width  at  the  detector’s  output 
is  only  6  ns  at  the  half  height.  If  the  sampling  rate  increases  to  2  Gs/s,  12  samples  can  be  effectively  captured  for 
each  received  pulse.  The  current  connection  between  the  ADC  and  the  FPGA  board  is  loose  and  the  individual  path 
lengths  for  different  bits  of  a  sample  are  not  accurately  aligned  mainly  due  to  physical  PCB  routing  limitation  in  the 
cable-connector  adapting  PCB. 

A  high  speed  ribbon  cable/bus  has  been  successfully  used  to  connect  the  DAC  and  the  FPGA  board  in  the  transmitter. 
However,  It  is  doubtable  for  this  type  of  cables  to  meet  our  higher  sampling  rate  requirement.  One  solution  to  the 
bottleneck  caused  by  connection  is  to  use  a  signal  processing  package  with  an  ADC  (or  a  DAC)  and  an  FPGA  chip 
integrated  on  a  single  board.  We  are  searching  such  type  of  boards  and  evaluating  their  performance  to  price  ratios. 
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4.3  Non-coherent  Threshold-less  Detection 


OOK  modulation  has  been  adopted  in  the  system  mainly  for  its  simplicity  for  non-coherent  detection.  The  number 
one  bothering  problem  in  the  system  with  OOK  modulation  and  non-coherent  detection  is  the  sensitivity  to  the 
decision  threshold.  An  optimum  threshold  depends  on  the  SNR  at  the  receiver,  and  it  requires  sophisticated  signal 
processing  to  achieve  the  optimum,  which  prevents  the  performance  from  being  the  optimum  practically. 

Per  above  discussion,  non-coherent  threshold-less  detection  is  extremely  desired.  One  solution  is  a  combination  of 
PPM  modulation  and  non-coherent  detection.  Shown  in  Fig.4.1  is  a  conceptual  receiver  block  diagram  for  PPM 
modulation  with  non-coherent  detection. 


SI -SO 

Figure  4. 1 :  Conceptual  receiver  block  diagram  for  PPM  modulation  with  non-coherent  detection 


4.4  2-bv-l  MISO 


A  3-dB  array  gam  can  be  anticipated  if  the  number  of  transmitter  branches  increases  from  one  to  two.  For  the 
passband  signaling  approach,  the  carriers  in  the  two  branches  must  be  strictly  aligned,  which  requires  identical 
DACs,  modulators  and  power  amplifiers  in  the  both  branches.  The  two  branches  can  share  a  common  local  oscillator 
(LO).  We  are  currently  searching  devices  for  this  matter. 


4.5  Direct-Conversion  Receiver 


To  explore  more  communication  techniques,  especially  those  implemented  at  receiver  side,  a  coherent  receiver 
will  provide  more  potential  than  a  non-coherent  receiver.  Among  all  coherent  receiver  architectures  is  a  direct- 
conversion  receiver  which  is  popular  nowadays.  This  type  of  receiver  is  also  called  homodyne,  synchrodyne  or 
zero-IF  receiver.  An  attractive  feature  of  a  direct-conversion  receiver  is  that  it  requires  less  analog  processing  but 
more  digital  processing.  This  feature  favors  our  design  philosophy  in  terms  of  flexibility,  thus  it  has  been  considered 
as  our  new  receiver.  In  UWB  band,  a  modulation-demodulation  scheme  to  test  on  this  receiver  might  be  transmitted 
reference  or  its  variants. 


4.6.  REAL-TIME  ERROR  RATE  MEASUREMENT 
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4.6  Real-Time  Error  Rate  Measurement 


So  far  we  have  tested  frame  loss  rate  (due  to  synchronization  failure)  and  bit  error  rate  in  a  non-real-time  way.  A 
real-time  error  meter  will  be  extremely  desired  since  it  can  serve  as  an  indicator  in  a  system  trial  and  make  the 
measurement  more  efficient  and  effective.  A  stand-alone  real-time  error  meter  mainly  consists  of  a  data  acquisition 
section  as  well  as  a  data  processing  section,  and  it  can  be  virtually  used  in  different  situations.  A  difficult  part  of  a 
stand-alone  error  meter  is  to  synchronize  the  meter  with  the  incoming  data  stream. 


22 


CHAPTER  4.  CURRENT  AND  NEAR-FUTURE  WORK 


4.7  Implementation  of  a  Compressed  Sensing  Based  Communication  System 


We  propose  an  implementable  Compressed  Sensing  (CS)  based  communication  system,  as  illustrated  in  Fig.  4.2. 
A  1  Ghz  UWB  signal  is  transmitted  in  the  10  GHz  carrier  and  captured  by  a  25  Msps  A/D.  This  system  utilizes 
the  concept  in  the  October  2008 ’s  report  but  with  smaller  bandwidth.  This  is  due  to  the  limitation  of  available  RF 
hardware.  The  baseband  signal  must  be  modulated  to  carrier  frequency  because  no  antennas  can  transmit  signal 
with  frequency  starting  from  DC.  Sparse  pulse  position  modulation  (PPM)  data  is  digitally  filtered  by  a  high  rate  PN 
filter  in  FPGA.  For  a  Xilinx  Virtex  5  FPGA,  the  rate  of  output  signal  can  be  up  to  3.2  Gsps  thanks  to  the  advanced 
RocketlO  technology.  The  circuity  in  FPGA  acts  as  a  virtual  interpolating  filter,  transforming  digital  signal  into 
analog  waveform.  Alternatively,  we  can  build  a  PN  sequence  generator  with  rate  up  to  even  80  Gsps  by  standard 
high  rate  shift  register.  However,  this  is  not  implementable  due  to  the  limitation  of  the  state-of-art  RF  technology.  To 
the  best  knowledge  of  the  author,  the  largest  bandwidth  supported  by  a  modulator  is  1  GHz,  provided  by  MITEQs 
SDM1015LI3CDQ  with  10  GHz  -  15  GHz  RF  output.  As  a  result,  the  output  signal  bandwidth  from  FPGA  should 
not  exceed  1  GHz.  Power  amplifier  (PA)  is  used  before  transmitting  the  UWB  signal  over  the  air.  The  structure  of 
the  receiver  is  very  simple.  There  is  no  down-conversion.  A  25  Msps  A/D  samples  the  data  directly  after  a  low  noise 
amplifier  (LNA)  and  passband  filter.  FPGA  is  used  to  acquire  the  A/D  data  and  process  with  recovery  algorithms 
such  as  OMP,  StOMP,  etc.. 

The  following  is  a  list  of  key  hardware  found  in  the  market  in  Fig.  4.2: 


•  FPGA:  Xilinx  Virtex  5  LX330T  with  3.25  Gsps  RocketlO 

•  Modulator:  M1TEQ  SDM1015LI3CDQ  with  1  GHz  modulation  bandwidth,  10  GHz  -  15  Ghz  RF  output 
frequency 

•  Local  Oscillator:  M1TEQ  LPLM 15000  with  8  GHz  -  15  GHz  output  frequency 

•  PA:  M1TEQ  AMF-6B-08001800-60-34P  with  8  GHz  -  18  GHz  frequency  range,  37  dB  gain 

•  LNA:  MITEQ  AMFW-5S- 1000 1500-250  with  10  GHz  -  15  GHz  frequency  range,  30  dB  gain 


■  UWB  Si<K*l  wl«  1  -GNj 


Figure  4.2:  Architecture  of  a  proposed  compressed  sensing  based  communication  system. 


Part  II 

Theoretical  Work 
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Wideband  Waveform  Optimization  with 
Energy  Detector  Receiver  in  Cognitive  Radio 
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5.1  Introduction 


Waveform  design  or  optimization  is  a  key  research  issue  in  the  current  wireless  communication  system.  Waveform 
should  be  designed  according  to  the  different  requirements  and  objectives  of  system  performance.  For  example,  the 
waveform  should  be  designed  to  carry  more  information  to  the  receiver  in  terms  of  capacity.  For  navigation  and 
geolocation,  the  ultra  short  waveform  should  be  used  to  increase  the  resolution.  If  the  energy  detector  is  employed 
at  the  receiver,  the  waveform  should  be  optimized  such  that  the  energy  of  the  signal  within  integration  window  at 
the  receiver  should  be  maximized.  In  the  context  of  cognitive  radio,  waveform  design  or  optimization  gives  us  more 
flexibility  to  design  a  radio  which  can  coexist  with  other  cognitive  radios  and  primary  radios.  From  cognitive  radio's 
point  of  view,  spectral  mask  constraint  at  the  transmitter  and  the  influence  of  Arbitrary  Notch  Filter  at  the  receiver 
should  be  seriously  considered  for  waveform  design  or  optimization,  except  the  consideration  of  the  traditional  com¬ 
munication  objectives.  Spectral  mask  constraint  is  imposed  on  the  transmitted  waveform  such  that  cognitive  radio 
has  no  interference  to  primary  radio.  While  Arbitrary  Notch  Filter  at  the  receiver  is  used  to  cancel  the  interference 
from  primary  radio  to  cognitive  radio.  Though  Arbitrary  Notch  Filter  is  at  the  receiver,  the  influence  of  Arbitrary 
Notch  Filter  should  be  incorporated  into  waveform  design  at  the  transmitter.  How  to  implement  Arbitrary  Notch 
Filter  is  out  of  scope  of  this  chapter. 

The  optimization  problem  is  formulated  as  Quadratically  Constrained  Quadratic  Program  (QCQP),  which  is  known 
as  an  NP-hard  problem.  However,  after  relaxation,  QCQP  can  be  solved  by  using  Semidefinite  Program  (SDP)  plus 
a  randomized  algorithm.  In  addition  to  SDR  a  computationally-efficient  iterative  method  is  proposed  to  give  the 
optimal  solution  for  a  special  situation  where  the  integrator  only  captures  one  sample  after  ADC  as  the  decision 
statistic. 


5.2  System  Description  and  Optimal  Waveform 


The  system  architecture  is  shown  in  Figure  5.1.  We  limit  our  discussion  to  a  single-user  scenario,  and  consider  the 
transmitted  signal  with  OOK  modulation  given  by 

oc 

s(t)  =  djp(t  —  jTb)  (5.1) 

j=—o O 

where  TJ,  is  the  symbol  duration,  p(t)  is  the  transmitted  symbol  waveform  defined  over  [0,  Tp\  and  dj  €  {0, 1}  is  the 
j-th  transmitted  bit.  Without  loss  of  generality,  assume  the  minimal  propagation  delay  is  equal  to  zero.  The  energy 
of  p(t)  is  Ep , 

[  p2(t)dt  =  Ep  (5.2) 

Jo 

and  the  frequency  domain  representation  of  p(t)  is  p/(/).  Thus, 

/oo 

\Pf(f)\2df  =  Ep  (5.3) 

-oo 

The  received  noise-polluted  signal  at  the  output  of  low  noise  amplifier  (LNA)  is 

r(t)  —  h(t)  <8>  s  (t)  +n  (t)  (5.4) 

oo 

=  ^2  djx  (t  -  iTb)  +  n  (t) 

j— — OO 


(5.5) 


5.2.  SYSTEM  DESCRIPTION  AND  OPTIMAL  WAVEFORM 


27 


Figure  5.1:  System  architecture. 

where  h(t)  ,t  £  [0,  7/J  is  the  multipath  impulse  response  that  takes  into  account  the  effect  of  channel  impulse 
response,  the  RF  front-ends  in  the  transceivers  such  as  Power  Amplifier,  LNA  and  Arbitrary  Notch  Filter  as  well  as 
antennas.  h(t)  is  available  at  the  transmitter  [1]  [2]  and  the  energy  of  h(t)  is  Ek, 

h2(t)dt  =  Eh  (5.6) 

and  the  frequency  domain  representation  of  h(t)  is  hf(f).  Thus, 

/oo 

\hf(f)\2df  =  Eh  (5.7) 

-OO 

“0”  denotes  convolution  operation.  r i(t)  is  a  low-pass  additive  zero-mean  Gaussian  noise  with  one-sided  bandwidth 
W  and  one-sided  power  spectral  density  No-  x(t)  is  the  received  noiseless  symbol-“l”  waveform  defined  as 

x(t)  =  h(t)  ®p(t)  (5.8) 


We  further  assume  that  Tb  >  A  Tp  =  Tx,  i.e.  no  existence  of  ISI. 

An  energy  detector  performs  square  operation  to  r(t).  Then  the  integrator  does  the  integration  over  a  given  integra¬ 
tion  window  T/.  Corresponding  to  the  time  index  k ,  the  k- th  decision  statistic  at  the  output  of  the  integrator  is  given 
by 


zk  = 


L 

L 


kTb+Tio+T] 


kT^+Tjo 

ArTb+T/o+T/ 


kTb+Tio 


r2(t)dt 

(dkx(t  —  kTb)  +  n{t))2dt 


(5.9) 

(5.10) 


where  T/o  is  the  starting  time  of  integration  for  each  symbol  and  0  <  T/0  <  T/o  T-  Tj  <  Tx  <  Tt>. 

An  approximately  equivalent  SNR  for  the  energy  detector  receiver,  which  provides  the  same  detection  performance 
when  applied  to  a  coherent  receiver,  is  given  as  [3] 


SNReq  = 


2(g+T/*2(^)2 

2.3T/W7V02  +  No  x 2  (t)  dt 


(5.11) 
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For  best  performance,  the  equivalent  SNR  SNReq  should  be  maximized.  Define, 

rTjo+Ti 

E,=  x2  (t)  dt  (5.12) 

JT,  o 

For  given  T/,  No  and  W ,  SNReq  is  the  increasing  function  of  Ej.  So  the  maximization  of  SNReq  in  Equation  5.11 
is  equivalent  to  the  maximization  of  E\  in  Equation  5.12. 


So  the  optimization  problem  to  get  the  optimal  p(t )  is  shown  below, 

max  Jt™+Ti  x2  ( t )  dt 

s.t.  JqPp2  ( t)dt  -  Ep 


(5.13) 


In  order  to  solve  the  optimization  problem  5.13,  a  numerical  approach  is  employed  in  this  document.  In  other  words, 
p(t ),  h(t)  and  x(t)  are  uniformly  sampled  (assumed  at  Nyquist  rate),  and  the  optimization  problem  5.13  will  be 
converted  to  its  corresponding  diserete-time  form.  Assume  the  sampling  period  is  Ts.  Tp/Ts  =  Np  and  Np  is 
assumed  to  be  even,  T^/Ts  =  and  Tx/Ta  =  Nx.  So  Nx  =  Np  + 

p(t)y  h(t)  and  x(t)  are  represented  by  px,i  =  0, 1, ... ,  Np ,  /zz,  i  =  0, 1, . . . ,  and  x*,  i  =  0, 1, . . . ,  Nx  respec¬ 
tively  [3], 


Define, 


and 


P  =  bo  Pl  ■■■  PA'p]7 

(5.14) 

X  =  [xq  X\  ■■■  xNz}t 

(5.15) 

Construct  channel  matrix  H 


(Nx  +  \)x(Np+\)i 


( 


H)  =  / 

t±hj  \  o, 


-j,  0<i-j<Nh 


else 


(5.16) 


where  (•)i  -  denotes  the  entry  in  the  z-th  row  and  j-th  eolumn  of  the  matrix  or  vector.  Meanwhile,  for  veetor,  taking 
p  as  an  example,  (p)^  l  is  equivalent  to  pz-\. 


The  matrix  expression  of  Equation  5.8  is, 

x  =  Hp 

and  the  constraint  in  the  optimization  problem  5.13  can  be  expressed  as, 

llplll  Ts  =  Ep 


(5.17) 


(5.18) 


where  “II  •  |L”  denotes  the  norm-2  of  the  vector.  In  order  to  make  the  whole  document  consistent,  we  further  assume, 


ipiil  =  i 


(5.19) 


Let  Tj/Ts  =  Nj  and  Tjq/Ts  =  Nj$.  The  entries  in  x  within  integration  window  constitute  x/  as, 

X/  =  [^iV/o  XNfo  +  l  *  *  *  ZNjq  +  Nj  ] 


(5.20) 
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and  Ej  in  Equation  5.12  can  be  equivalently  shown  as, 


Ei  =  ||x/  H2  T$ 


(5.21) 


Simply  dropping  Ts  in  E j  will  not  affect  the  optimization  objective,  so  Ej  is  redefined  as, 


El  =  Mil 


(5.22) 


Similar  to  Equation  5.17,  x/  can  be  obtained  by, 

X/  =  H/P 

where  =  (H)W;fl+ij.  and  i  =  1, 2 - ,  Nj  +  1  as  well  as  j  =  1, 2 - -  Np  +  1. 


(5.23) 


The  optimization  problem  5.13  can  be  represented  by  its  discrete-time  form  as, 


The  optimal  solution  p*  for  the  optimization  problem  5.24  is  the  dominant  eigen-vector  in  the  following  eigen¬ 
function  [3], 

H/  H/p  =  Ap  (5.25) 

Furthermore,  E*j  will  be  obtained  by  Equation  5.22  and  Equation  5.23. 

5.3  Wideband  Waveform  Design  in  Cognitive  Radio 

For  cognitive  radio,  there  is  a  spectral  mask  constraint  for  the  transmitted  waveform.  Based  on  the  previous  discus¬ 
sion,  p  is  assumed  to  be  the  transmitted  waveform  and  F  is  the  Fourier  transform  matrix,  thus  the  frequency  domain 
representation  of  p  is, 

P /  -  Fp  (5.26) 

where  py  is  a  complex  value  vector.  If  the  i-th  row  of  F  is  f?,  then  each  complex  value  in  py  can  be  represented  by, 


(5.27) 


Thus  the  modulus  of  the  complex  value  is  |«|. 


Define, 


(5.28) 


r  iT 

Given  the  spectral  mask  constraint  in  terms  of  power  spectral  density  c  =  C\  C2  •  •  •  ,  so 

L  o  •  E 


2 


(5.29) 

(5.30) 

(5.31) 

(5.32) 


2 
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So  the  optimization  problem  with  spectral  mask  constraint  can  be  expressed  as, 

max  £7 


s.t. 


Ml  <  1 

->;Fip  <  ci 


(5.33) 


pJF^+1p<c^+1 

Define, 

H„  =  H/H/  (5.34) 

Equation  5.22,  Equation  5.23  and  Equation  5.34  are  put  into  the  optimization  problem  5.33,  and  then  this  optimiza¬ 
tion  problem  will  be  formulated  as, 

maxp1  H0p 

p7  p  <  1 

PTFip  <  ci  (5.35) 

PJF^+1p<c^+1 


In  the  optimization  problem  5.35,  both  the  objective  function  and  the  constraints  are  quadratic  functions,  so  this  kind 
of  optimization  problem  is  a  QCQP.  The  general  QCQP  is  a  NP-hard  problem.  A  semidefinite  relaxation  method  is 
used  in  this  document  to  give  the  suboptimal  solution  to  the  optimization  problem  5.35. 

Define, 

P  =  ppr  (5.36) 

So  P  should  be  the  symmetric  positive  semidefinite  matrix,  i.e.  P  >-=  0  and  rank  of  P  should  be  equal  to  1. 


Then, 


PTHoP 

(5.37) 

trace  (Hopp7 ) 

(5.38) 

trace  (H0P) 

(5.39) 

and  ||p||2  =  trace  (P).  p^  Fzp  =  trace  (F?P) ,  i  =  1, 2, . . . ,  +  1. 


Rank  constraint  is  nonconvex  constraint,  so  after  dropping  it,  QCQP  is  relaxed  to  SDP, 

max  trace  (H0P) 

trace  (P)  <  1 
trace  (FiP)  <  Ci 

S.t.  ; 

trace  (F^+iP )  <c^+1 


(5.40) 


The  optimal  solution  P*  of  the  optimization  problem  5.40  can  be  obtained  by  using  CVX  tool  [4]  and  the  value  of 
the  objective  function  gives  the  upper  bound  of  the  optimal  value  in  the  optimization  problem  5.35.  If  the  rank  of 
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P*  is  equal  to  1,  then  the  dominant  eigen-vector  of  P*  will  be  the  optimal  solution  p*  for  the  optimization  problem 
5.35.  But  if  the  rank  of  P*  is  not  equal  to  1,  the  dominant  eigen-vector  of  P*  can  not  be  treated  as  the  optimal 
solution  for  the  optimization  problem  5.35,  because  of  the  violation  of  bound  constraint. 

An  efficient  randomized  algorithm  is  given  by  [5]  [6]  to  obtain  the  feasible  solution  p*  from  P*. 

This  algorithm  will  be  repeated  a  sufficient  number  of  times  to  get  the  relatively  good  feasible  p*  as  the  suboptimal 
solution  to  the  optimization  problem  5.35. 

When  Tj  approaches  zero,  which  means  the  integrator  only  captures  one  sample  after  ADC  as  the  decision  statistic, 
the  maximization  of  Ej  will  be  equivalent  to  the  maximization  of  x  (T/ o).  An  iterative  algorithm  is  proposed  here 
to  give  the  solution  to  the  maximization  of  x  (T/o),  which  will  be  more  computationally  efficient  than  the  previous 
semidefinite  relaxation  method.  For  the  simplicity  of  the  following  presentation,  T/o  is  assumed  to  be  zero. 


From  the  inverse  Fourier  transform, 


and  from  Equation  5.8, 


/oo 

xfU)ej2*}tdf 

-oo 


xf(f)  =  hf(f)Pf(f) 

where  Xf  (/)  is  the  the  frequency  domain  representation  of  x(£)  respectively. 

So, 

/oo 

xf  (/)  df 

-OO 

/OO 

hf  (/)  Pf  (/)  df 

-OO 

If  there  is  no  spectral  mask  constraint,  then  according  to  the  Cauchy— Schwarz  inequality, 

/oo 

hf  (/)  Pf  (/)  df 

-oo 


(5.41) 

(5.42) 


(5.43) 

(5.44) 


< 


I  roc  Too 

\  \h/(f)\2df  |p/(/)|5 

y  J — oo  J  — oo 


df 


y/  EhEp 


(5.45) 

(5.46) 

(5.47) 


when  pf  (/)  =  ahj  (/)  for  all  /,  the  equality  is  obtained.  In  this  case,  p(t)  =  ah(—t),  which  means  the  optimal 
waveform  is  the  time  reversed  multipath  impulse  response.  And, 


a 


(5.48) 


If  there  is  a  spectral  mask  constraint,  then  the  following  optimization  problem  will  become  more  complicated. 

max  x  (0) 

s-t-  J-oc\Pf  (f)\2  df  <  1 

I  Pf  (/)|2  <  Cf(f) 


(5.49) 
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where  cj  (/)  represents  the  arbitrary  spectral  mask  constraint. 

Because  pj(f)  is  the  complex  value,  the  phase  and  the  modulus  of  p/(f)  should  be  determined. 


/oo 

V  (/)  Pf  (/)  df 

-oo 

-  r  I  hf  (/)|  eJ2’rar*(fe/(/))  |  pf  (/)  |  eV*MPfU))df 

J — OO 

/oo 

|/ly  (/)|  |P/  (/)|  eiMMhfU))+™s(PfV)))df 

■oo 

where  the  angular  component  of  the  complex  value  is  arg  (•). 

For  the  real  value  signal  x(t ), 


Xf  (/)  =  2 ;}(-/) 


where  denotes  conjugate  operation. 


*/(/)  =  IV  (/)|  Ip/  (/)|  eMMhfU))+^(pfU))) 

xf  (-/)  =  IV  (/)|  \pf  (f)\ 
and  x j(f)  -F  /)  is  equal  to 

IV  (/)l  I  Pf  (/)|  cos(2tt  (arg  (hf  (/))  +  arg  ( pf  (/)))) 


If  h(f)  and  (/)|  is  given,  maximization  of  x  (0)  is  equivalent  to  setting, 

arg  {h  (/))  +  arg  (p  (/))  =0 

which  means  the  angular  component  of  pj  (/)  is  the  negative  angular  component  of  hj  (/). 
The  optimization  problem  5.49  can  be  simplified  as, 

max/^IM/)lb/(/)K 

Ip/  U)\2df<  l 

Ip/(/)I2<C/  (/),/>  o 


(5.50) 

(5.51) 

(5.52) 


(5.53) 

(5.54) 

(5.55) 

(5.56) 

(5.57) 


(5.58) 


Because, 

IV(/)I  =  IV(-/)I  (5-59) 

|P/(/)l  =  |P/(-/)l  (5-60) 

M/)I  =  M-/)|  (5.61) 

for  all  /.  Thus  uniformly  discrete  frequency  points  /q,  . . .,  f\j  are  considered  in  the  optimization  problem  5.58. 
Meanwhile,  /o  corresponds  to  the  DC  component  and  f\, . . correspond  to  the  positive  frequency  component. 


IV  (/«-i)M  = 1 

n/2  |V  (/i-i)|,t  =  2,...,^  +  l 


Define  vector  hf. 


(5.62) 
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Define  vector  p^, 

f  \Pf  (/i-l)l  1  i  —  1 

1  n/2  \pf  (/i-i)|  ,i  =  2, . . . ,  ^  +  1 


I  y/cf(fi- 1),  i  =  1 

1  \/2c/  (/i-i),  t  =  2, . . . ,  4^  +  1 


Thus,  the  discrete  version  of  the  optimization  problem  5.58  is  shown  below, 

max  h/p/ 

S.t.  P/H2  ^  1 
Pf<Cf 


(P/)«M 


Define  vector  c y 


(rA 


(5.63) 


(5.64) 


(5.65) 


So  an  iterative  algorithm  is  proposed  to  get  the  optimal  solution  to  the  optimization  problem  5.65  as  follows. 
1  Initialization:  P  =  1  and  is  set  to  be  an  all-0  column  vector. 

2.  Solve  the  following  optimization  problem  to  get  the  optimal  using  Cauchy— Schwarz  inequality. 


max 

s.t.  II q/ II 2  <  P 


(5.66) 


i  >  (c/)j,i  j-  If  {*}  =  then  the 

algorithm  is  terminated  and  p^  :=  p^  +  q^.  Otherwise  go  to  step  4. 

4'  Set  (P/)M  = 

5.  P  P  -  (c/)?j  and  set  (hf)i  {  to  zero.  Go  to  step  2. 


3.  Find  i ,  such  that 


i,  1 


is  the  maximal  value  in  the  set 


When  pj  is  obtained  for  the  optimization  problem  5.65,  from  Equation  5.57  and  Equation  5.63,  the  optimal  p/(f) 
and  the  corresponding  p(t )  can  be  smoothly  achieved. 


The  proposed  iterative  algorithm  gives  the  optimal  solution  to  the  optimization  problem  5.65,  which  can  be  proofed 
as  follows. 


Given  column  vector  a,  all  the  entries  a*, . . . ,  a/v  in  a  are  positive.  Meanwhile,  we  assume  a\  >  max  {(12,  •  .  •  ,ajv}- 
We  would  like  to  solve  the  following  optimization  problem  to  get  the  optimal  column  vector  b, 


max  a7  b 
s.t. b7b  <  P 
b\  <  c 


c  < 


aiVP 

N 
i= 1 


(5.67) 
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For  optimal  solution  b*,  if  b\  —  c ,  then  the  optimum  of  the  iterative  algorithm  is  achieved.  Set  b\  =  ac,  where 
0  <  a  <  1,  then, 


N 

a]b  = 

2=1 

N 

=  a\b\  +  ^  aibt 
2=2 


< 


aa\c  + 


\ 


N 


Ea?vp 


2=2 


(5.68) 

(5.69) 

(5.70) 


'  N 


Let  /  (a)  =  aa\c  +  \j  ^2  ~  the  optimization  problem  is  reduced  to, 

V  2= 2 


max  /  (a) 
s.t. 0  <  a  <  1 


(5.71) 


From  /  (a),  we  can  get  jL  (a), 


Q\C 


/  N 

“2y 

2=2 

v  P  —  c*2c2 


(5.72) 


Let  £  (q)  is  equal  to  zero,  then  a  is  equal  to  ±  aiA/^>  .  Because  of  the  constraint  on  c,  |a|  is  greater  than  1. 

t-1 

Meanwhile,  (0)  =  a\c9  which  means  (a)  keeps  positive  value  in  [0, 1].  Thus  /  (a)  is  a  monotone  increasing 

function  in  [0, 1  .  arg  max  /  (a)  =  1.  So,  bi  =  c.  The  optimum  of  the  iterative  algorithm  is  verified. 

<*€[0,1] 


5.4  Numerical  Results 


Figure  5.2  shows  the  multipath  impulse  response  h(t)  under  investigation  in  this  document  and  the  energy  of  h(t) 
is  normalized.  Figure  5.3  shows  the  frequency  domain  representation  hf(f)  of  h(t).  Green  notch  line  in  Figure 
5.3  shows  the  influence  of  Arbitrary  Notch  Filter  at  the  receiver.  Ts  =  0.5ns,  Th  =  100ns,  Tp  =  100ns  and 
T/o  +  =  100ns.  In  the  following  presentation,  the  designed  waveform,  which  is  obtained  by  SDP  and  the  rank  of 

P*  to  the  optimization  problem  5.40  is  not  equal  to  1,  is  called  Waveform  SDP  Unverified;  the  designed  waveform, 
which  is  obtained  by  SDP  and  the  rank  of  P*  to  the  optimization  problem  5.40  is  equal  to  1  or  which  is  obtained 
by  SDP  plus  a  randomized  algorithm,  is  called  Waveform  SDP;  the  designed  waveform,  which  is  obtained  by  an 
iterative  algorithm,  is  called  Waveform  by  Iterative  Algorithm. 

If  T/  — >  0,  which  means  the  integrator  only  captures  one  sample  after  ADC  as  the  decision  statistic,  then  spectral 
mask,  notch  line  of  Arbitrary  Notch  Filter,  the  designed  waveforms  and  multipath  impulse  response  represented 
in  the  frequency  domain  are  shown  in  Figure  5.4.  Meanwhile,  the  designed  waveforms  in  time  domain  are  shown 
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Figure  5.2:  Multhpath  impulse  response. 


Figure  5.3:  Multipath  impulse  response  and  notch  line  of  Arbitrary  Notch  Filter  represented  in  the  frequency  domain. 

in  Figure  5.5.  Figure  5.6  shows  the  convolution  between  the  designed  waveforms  and  multipath  channel  impulse 
response.  In  this  case  spectral  mask  constraint  is  not  tight.  The  peak  value  of  noiseless  signal  in  Figure  5.6  is  I. 
If  the  arbitrary  spectral  mask  constraint  is  imposed  on  the  transmitted  waveform,  then  spectral  mask,  notch  line  of 
Arbitrary  Notch  Filter,  the  designed  waveforms  and  multipath  impulse  response  represented  in  the  frequency  domain 
are  shown  in  Figure  5.7.  Meanwhile,  the  designed  waveforms  in  time  domain  are  shown  in  Figure  5.8.  Figure  5.9 
shows  the  convolution  between  the  designed  waveforms  and  multipath  channel  impulse  response.  The  peak  value  of 
the  noiseless  signal  in  Figure  5.9  is  0.94.  This  sharp  peak  can  give  us  benefits  for  wireless  communication. 

In  sum,  when  Tj  — >  0,  the  waveform  by  an  iterative  algorithm  is  the  same  as  the  waveform  by  SDP.  SDP  always  gives 
the  optimal  solution  to  the  optimization  problem  5.35,  which  means  the  rank  of  P*  to  the  optimization  problem  5.40 
is  always  equal  to  1.  The  reason  of  this  result  is  that  in  this  kind  of  situation  the  rank  of  H0  is  equal  to  1.  Meanwhile, 
the  transmitted  waveform  satisfies  the  spectral  mask  constraint,  which  will  make  cognitive  radio  have  no  interference 
to  primary  radio.  Besides,  cognitive  radio  will  transmit  the  waveform  such  that  the  energy  will  not  be  transmitted  in 
the  notched  frequency  band  at  the  receiver  in  order  to  save  the  energy.  Thus,  the  designed  waveform  can  be  treated 
as  a  wideband  non-continuity  signal. 

If  T[  is  equal  to  20ns,  then  two  different  spectral  masks  are  considered.  One  spectral  mask  and  the  corresponding 
notch  line  of  Arbitrary  Notch  Filter,  the  designed  waveforms  and  multipath  impulse  response  represented  in  the 
frequency  domain  are  shown  in  Figure  5.10.  The  other  is  shown  in  Figure  5. 1 1 .  When  X/  is  larger  than  0,  only  SDP 
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Figure  5.4:  Spectral  mask,  notch  line  of  Arbitrary  Notch  Filter,  the  designed  waveforms  and  multipath  impulse 
response  represented  in  the  frequency  domain  when  Tj  —*  0. 


Figure  5.5:  Designed  waveforms  in  the  time  domain  when  Tj  — >  0. 

can  not  guarantee  the  optimal  solution  at  all  times  because  the  rank  of  P*  to  the  optimization  problem  5.40  can  not 
be  always  equal  to  1.  In  the  first  case,  only  SDP  gives  the  optimal  solution.  But  in  the  second  case,  the  waveform 
obtained  only  by  SDP  violates  the  spectral  mask  constraint  in  some  frequency  points,  and  SDP  plus  a  randomized 
algorithm  gives  the  suboptimal  solution. 


5.5  Discussion 


This  chapter  deals  with  wideband  waveform  optimization  with  energy  detector  in  cognitive  radio.  Wideband  wave¬ 
form  is  designed  according  to  the  optimization  objective  with  the  considerations  of  the  spectral  mask  constraint 
at  the  transmitter  and  the  influence  of  Arbitrary  Notch  Filter  at  the  receiver.  The  optimization  problem  is  formu¬ 
lated  as  QCQP.  After  relaxation,  QCQP  can  be  solved  by  using  SDP  plus  a  randomized  algorithm.  Meanwhile,  a 
computationally-efficient  iterative  method  is  proposed  to  give  the  optimal  solution  for  a  special  situation  where  the 
integrator  only  captures  one  sample  after  ADC  as  the  decision  statistic. 


5.5.  DISCUSSION 


37 


Figure  5.6:  Convolution  between  the  designed  waveforms  and  the  multipath  impulse  response  when  Tj  — ►  0. 


Figure  5.7:  Spectral  mask,  notch  line  of  Arbitrary  Notch  Filter,  the  designed  waveforms  and  multipath  impulse 
response  represented  in  the  frequency  domain  when  Tj  — >  0. 


Figure  5.8:  Designed  waveforms  in  the  time  domain  when  Tj  — ►  0. 
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Figure  5.9:  Convolution  between  the  designed  waveforms  and  the  multipath  impulse  response  when  T/  —>  0. 


Figure  5.10:  Spectral  mask,  notch  line  of  Arbitrary  Notch  Filter,  the  designed  waveforms  and  multipath  impulse 
response  represented  in  the  frequency  domain  when  Tj  is  20ns. 


Figure  5.1 1:  Spectral  mask,  notch  line  of  Arbitrary  Notch  Filter,  the  designed  waveforms  and  multipath  impulse 
response  represented  in  the  frequency  domain  when  Tj  is  20ns. 
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